Overview

Dataset statistics

Number of variables22
Number of observations301666
Missing cells154566
Missing cells (%)2.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory42.6 MiB
Average record size in memory148.0 B

Variable types

CAT10
NUM8
BOOL4

Warnings

Latitude has 77283 (25.6%) missing values Missing
Longitude has 77283 (25.6%) missing values Missing
Unnamed: 0 has unique values Unique
day_of_week has 39355 (13.0%) zeros Zeros
hour has 23282 (7.7%) zeros Zeros
minute has 39858 (13.2%) zeros Zeros

Reproduction

Analysis started2021-02-05 14:53:30.157689
Analysis finished2021-02-05 14:54:07.357186
Duration37.2 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

Unnamed: 0
Real number (ℝ≥0)

UNIQUE

Distinct301666
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean338338.5259
Minimum0
Maximum660610
Zeros1
Zeros (%)< 0.1%
Memory size2.3 MiB
2021-02-05T14:54:07.596958image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile31470.25
Q1167073.25
median347579.5
Q3516833.75
95-th percentile632514.75
Maximum660610
Range660610
Interquartile range (IQR)349760.5

Descriptive statistics

Standard deviation195434.7634
Coefficient of variation (CV)0.5776308296
Kurtosis-1.21808208
Mean338338.5259
Median Absolute Deviation (MAD)174880.5
Skewness-0.06097040379
Sum1.020652297e+11
Variance3.819474673e+10
MonotocityStrictly increasing
2021-02-05T14:54:07.779615image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
20471< 0.1%
 
4555631< 0.1%
 
684681< 0.1%
 
664211< 0.1%
 
725661< 0.1%
 
705191< 0.1%
 
6268201< 0.1%
 
6214341< 0.1%
 
6193871< 0.1%
 
848601< 0.1%
 
Other values (301656)301656> 99.9%
 
ValueCountFrequency (%) 
01< 0.1%
 
11< 0.1%
 
21< 0.1%
 
31< 0.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
6606101< 0.1%
 
6606091< 0.1%
 
6606081< 0.1%
 
6606071< 0.1%
 
6606061< 0.1%
 

Type
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
person search
230152 
person and vehicle search
71401 
vehicle search
 
113
ValueCountFrequency (%) 
person search23015276.3%
 
person and vehicle search7140123.7%
 
vehicle search113< 0.1%
 
2021-02-05T14:54:07.961673image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2021-02-05T14:54:08.083996image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:08.171732image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length25
Median length13
Mean length15.84064164
Min length13
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
missing
148496 
False
137070 
True
16100 
ValueCountFrequency (%) 
missing14849649.2%
 
False13707045.4%
 
True161005.3%
 
2021-02-05T14:54:08.335682image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2021-02-05T14:54:08.464965image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:08.557195image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length7
Median length5
Mean length5.931135759
Min length4

Latitude
Real number (ℝ≥0)

MISSING

Distinct78337
Distinct (%)34.9%
Missing77283
Missing (%)25.6%
Infinite0
Infinite (%)0.0%
Mean52.51750653
Minimum49.892149
Maximum57.143856
Zeros0
Zeros (%)0.0%
Memory size2.3 MiB
2021-02-05T14:54:08.757253image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum49.892149
5-th percentile50.8307412
Q151.4989
median52.617842
Q353.424474
95-th percentile54.548465
Maximum57.143856
Range7.251707
Interquartile range (IQR)1.925574

Descriptive statistics

Standard deviation1.131710504
Coefficient of variation (CV)0.02154920481
Kurtosis-0.9270221553
Mean52.51750653
Median Absolute Deviation (MAD)0.896118
Skewness0.1884210566
Sum11784035.67
Variance1.280768665
MonotocityNot monotonic
2021-02-05T14:54:08.947376image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
51.40274712430.4%
 
53.4039145020.2%
 
51.5412643360.1%
 
51.627523350.1%
 
53.4045632980.1%
 
53.4775122870.1%
 
51.4076652700.1%
 
53.4079392590.1%
 
53.4021222350.1%
 
53.0464032230.1%
 
Other values (78327)22039573.1%
 
(Missing)7728325.6%
 
ValueCountFrequency (%) 
49.8921491< 0.1%
 
49.9222991< 0.1%
 
49.9524641< 0.1%
 
49.9593581< 0.1%
 
50.0817651< 0.1%
 
ValueCountFrequency (%) 
57.1438563< 0.1%
 
56.4575311< 0.1%
 
56.3938532< 0.1%
 
55.98461< 0.1%
 
55.9524969< 0.1%
 

Longitude
Real number (ℝ)

MISSING

Distinct78711
Distinct (%)35.1%
Missing77283
Missing (%)25.6%
Infinite0
Infinite (%)0.0%
Mean-1.339926363
Minimum-8.053397
Maximum1.75648
Zeros0
Zeros (%)0.0%
Memory size2.3 MiB
2021-02-05T14:54:09.168715image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum-8.053397
5-th percentile-3.028236
Q1-2.6047225
median-1.45732
Q3-0.204165
95-th percentile0.937354
Maximum1.75648
Range9.809877
Interquartile range (IQR)2.4005575

Descriptive statistics

Standard deviation1.368559346
Coefficient of variation (CV)-1.021369072
Kurtosis-0.6463855179
Mean-1.339926363
Median Absolute Deviation (MAD)1.206935
Skewness0.1067613875
Sum-300656.6971
Variance1.872954683
MonotocityNot monotonic
2021-02-05T14:54:09.310215image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
-0.50981312430.4%
 
-2.9814995020.2%
 
-0.0033383360.1%
 
-0.7490933350.1%
 
-2.9824012970.1%
 
-2.2265862860.1%
 
-0.5126152700.1%
 
-2.9773642590.1%
 
-2.9808262350.1%
 
-2.1947672230.1%
 
Other values (78701)22039773.1%
 
(Missing)7728325.6%
 
ValueCountFrequency (%) 
-8.05339714< 0.1%
 
-8.0118061< 0.1%
 
-7.986535< 0.1%
 
-7.98078511< 0.1%
 
-7.971461< 0.1%
 
ValueCountFrequency (%) 
1.756481< 0.1%
 
1.756432< 0.1%
 
1.756171< 0.1%
 
1.7560725< 0.1%
 
1.7559271< 0.1%
 

Gender
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
male
270765 
female
30636 
other
 
265
ValueCountFrequency (%) 
male27076589.8%
 
female3063610.2%
 
other2650.1%
 
2021-02-05T14:54:09.443709image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2021-02-05T14:54:09.525048image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:09.628854image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length4
Mean length4.203990506
Min length4

Age range
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
18-24
102645 
25-34
73119 
over 34
66063 
10-17
59534 
under 10
 
305
ValueCountFrequency (%) 
18-2410264534.0%
 
25-347311924.2%
 
over 346606321.9%
 
10-175953419.7%
 
under 103050.1%
 
2021-02-05T14:54:09.754642image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2021-02-05T14:54:09.838001image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:09.945884image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length8
Median length5
Mean length5.441020864
Min length5
Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
white
238002 
black
31776 
asian
24233 
other
 
5839
mixed
 
1816
ValueCountFrequency (%) 
white23800278.9%
 
black3177610.5%
 
asian242338.0%
 
other58391.9%
 
mixed18160.6%
 
2021-02-05T14:54:10.061815image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2021-02-05T14:54:10.144453image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:10.272793image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length5
Median length5
Mean length5
Min length5

Legislation
Categorical

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
misuse of drugs act 1971 (section 23)
176838 
police and criminal evidence act 1984 (section 1)
91822 
missing
27420 
criminal justice and public order act 1994 (section 60)
 
2736
firearms act 1968 (section 47)
 
1849
Other values (13)
 
1001
ValueCountFrequency (%) 
misuse of drugs act 1971 (section 23)17683858.6%
 
police and criminal evidence act 1984 (section 1)9182230.4%
 
missing274209.1%
 
criminal justice and public order act 1994 (section 60)27360.9%
 
firearms act 1968 (section 47)18490.6%
 
criminal justice act 1988 (section 139b)6800.2%
 
poaching prevention act 1862 (section 2)148< 0.1%
 
psychoactive substances act 2016 (s36(2))90< 0.1%
 
wildlife and countryside act 1981 (section 19)31< 0.1%
 
police and criminal evidence act 1984 (section 6)15< 0.1%
 
Other values (8)37< 0.1%
 
2021-02-05T14:54:10.403774image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique3 ?
Unique (%)< 0.1%
2021-02-05T14:54:10.538223image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length55
Median length37
Mean length38.05770289
Min length7

Object of search
Categorical

Distinct16
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
controlled drugs
189975 
offensive weapons
35356 
article for use in theft
30030 
stolen goods
26088 
articles for use in criminal damage
 
6476
Other values (11)
 
13741
ValueCountFrequency (%) 
controlled drugs18997563.0%
 
offensive weapons3535611.7%
 
article for use in theft3003010.0%
 
stolen goods260888.6%
 
articles for use in criminal damage64762.1%
 
anything to threaten or harm anyone52041.7%
 
firearms29441.0%
 
evidence of offences under the act19070.6%
 
fireworks17100.6%
 
psychoactive substances17010.6%
 
Other values (6)2750.1%
 
2021-02-05T14:54:10.690436image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2021-02-05T14:54:10.871513image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length42
Median length16
Mean length17.35245271
Min length8
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size294.6 KiB
False
204070 
True
97596 
ValueCountFrequency (%) 
False20407067.6%
 
True9759632.4%
 
2021-02-05T14:54:10.988283image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size294.6 KiB
False
291332 
True
 
10334
ValueCountFrequency (%) 
False29133296.6%
 
True103343.4%
 
2021-02-05T14:54:11.034657image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

station
Categorical

Distinct41
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
merseyside
40864 
essex
 
18813
thames-valley
 
17647
west-yorkshire
 
16548
hampshire
 
13357
Other values (36)
194437 
ValueCountFrequency (%) 
merseyside4086413.5%
 
essex188136.2%
 
thames-valley176475.8%
 
west-yorkshire165485.5%
 
hampshire133574.4%
 
south-yorkshire131314.4%
 
hertfordshire129284.3%
 
kent128784.3%
 
surrey106353.5%
 
avon-and-somerset97003.2%
 
Other values (31)13516544.8%
 
2021-02-05T14:54:11.156585image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2021-02-05T14:54:11.376055image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length18
Median length10
Mean length10.61584004
Min length3

target
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size294.6 KiB
False
241105 
True
60561 
ValueCountFrequency (%) 
False24110579.9%
 
True6056120.1%
 
2021-02-05T14:54:11.705254image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size294.6 KiB
False
211580 
True
90086 
ValueCountFrequency (%) 
False21158070.1%
 
True9008629.9%
 
2021-02-05T14:54:11.766302image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

ethnicity
Categorical

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
white
212777 
missing
40705 
black
 
20362
asian
 
17947
mixed
 
8039
ValueCountFrequency (%) 
white21277770.5%
 
missing4070513.5%
 
black203626.7%
 
asian179475.9%
 
mixed80392.7%
 
other18360.6%
 
2021-02-05T14:54:11.849075image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2021-02-05T14:54:11.975525image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:12.097095image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length7
Median length5
Mean length5.269868
Min length5

year
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
2019
184070 
2018
117596 
ValueCountFrequency (%) 
201918407061.0%
 
201811759639.0%
 
2021-02-05T14:54:12.291894image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2021-02-05T14:54:12.434390image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:12.519644image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length4
Median length4
Mean length4
Min length4

month
Real number (ℝ≥0)

Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.844344407
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Memory size2.3 MiB
2021-02-05T14:54:12.676544image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median7
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.437626734
Coefficient of variation (CV)0.5022580001
Kurtosis-1.20347789
Mean6.844344407
Median Absolute Deviation (MAD)3
Skewness-0.1367373238
Sum2064706
Variance11.81727756
MonotocityNot monotonic
2021-02-05T14:54:12.830132image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%) 
103090210.2%
 
11297599.9%
 
12269848.9%
 
8262678.7%
 
9258248.6%
 
5244768.1%
 
7240778.0%
 
6239657.9%
 
4239147.9%
 
3229877.6%
 
Other values (2)4251114.1%
 
ValueCountFrequency (%) 
1223717.4%
 
2201406.7%
 
3229877.6%
 
4239147.9%
 
5244768.1%
 
ValueCountFrequency (%) 
12269848.9%
 
11297599.9%
 
103090210.2%
 
9258248.6%
 
8262678.7%
 

day
Real number (ℝ≥0)

Distinct31
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.77049452
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Memory size2.3 MiB
2021-02-05T14:54:12.999182image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q18
median16
Q323
95-th percentile29
Maximum31
Range30
Interquartile range (IQR)15

Descriptive statistics

Standard deviation8.741345795
Coefficient of variation (CV)0.554284825
Kurtosis-1.175055221
Mean15.77049452
Median Absolute Deviation (MAD)7
Skewness0.003070489734
Sum4757422
Variance76.41112631
MonotocityNot monotonic
2021-02-05T14:54:13.144848image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%) 
13104023.4%
 
20103463.4%
 
9102073.4%
 
18101943.4%
 
23101503.4%
 
21101273.4%
 
17100983.3%
 
12100943.3%
 
19100853.3%
 
22100413.3%
 
Other values (21)19992266.3%
 
ValueCountFrequency (%) 
196783.2%
 
293813.1%
 
397923.2%
 
495383.2%
 
599513.3%
 
ValueCountFrequency (%) 
3161372.0%
 
3088372.9%
 
2989233.0%
 
2897573.2%
 
2795293.2%
 

day_of_week
Real number (ℝ≥0)

ZEROS

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.064080805
Minimum0
Maximum6
Zeros39355
Zeros (%)13.0%
Memory size2.3 MiB
2021-02-05T14:54:13.267097image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q35
95-th percentile6
Maximum6
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation1.954546439
Coefficient of variation (CV)0.6378899787
Kurtosis-1.206874938
Mean3.064080805
Median Absolute Deviation (MAD)2
Skewness-0.0713532846
Sum924329
Variance3.820251782
MonotocityNot monotonic
2021-02-05T14:54:13.358279image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%) 
54802915.9%
 
44785115.9%
 
34359814.5%
 
24274314.2%
 
14080813.5%
 
03935513.0%
 
63928213.0%
 
ValueCountFrequency (%) 
03935513.0%
 
14080813.5%
 
24274314.2%
 
34359814.5%
 
44785115.9%
 
ValueCountFrequency (%) 
63928213.0%
 
54802915.9%
 
44785115.9%
 
34359814.5%
 
24274314.2%
 

hour
Real number (ℝ≥0)

ZEROS

Distinct24
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.53988517
Minimum0
Maximum23
Zeros23282
Zeros (%)7.7%
Memory size2.3 MiB
2021-02-05T14:54:13.583749image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q19
median15
Q320
95-th percentile23
Maximum23
Range23
Interquartile range (IQR)11

Descriptive statistics

Standard deviation7.385004255
Coefficient of variation (CV)0.5454259147
Kurtosis-0.89783832
Mean13.53988517
Median Absolute Deviation (MAD)5
Skewness-0.5640549754
Sum4084523
Variance54.53828784
MonotocityNot monotonic
2021-02-05T14:54:13.697366image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%) 
23271059.0%
 
0232827.7%
 
22177125.9%
 
21169105.6%
 
20168895.6%
 
19167145.5%
 
15164405.4%
 
16161295.3%
 
14160195.3%
 
18159265.3%
 
Other values (14)11854039.3%
 
ValueCountFrequency (%) 
0232827.7%
 
1144134.8%
 
298873.3%
 
370362.3%
 
442911.4%
 
ValueCountFrequency (%) 
23271059.0%
 
22177125.9%
 
21169105.6%
 
20168895.6%
 
19167145.5%
 

minute
Real number (ℝ≥0)

ZEROS

Distinct60
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.81117527
Minimum0
Maximum59
Zeros39858
Zeros (%)13.2%
Memory size2.3 MiB
2021-02-05T14:54:13.820763image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15
median22
Q340
95-th percentile55
Maximum59
Range59
Interquartile range (IQR)35

Descriptive statistics

Standard deviation18.4997296
Coefficient of variation (CV)0.7769347538
Kurtosis-1.256784146
Mean23.81117527
Median Absolute Deviation (MAD)17
Skewness0.2071741977
Sum7183022
Variance342.2399952
MonotocityNot monotonic
2021-02-05T14:54:13.953900image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
03985813.2%
 
1214767.1%
 
30214737.1%
 
45128534.3%
 
15128054.2%
 
20125644.2%
 
50119093.9%
 
10114643.8%
 
40113743.8%
 
579822.6%
 
Other values (50)13790845.7%
 
ValueCountFrequency (%) 
03985813.2%
 
1214767.1%
 
227300.9%
 
326090.9%
 
426010.9%
 
ValueCountFrequency (%) 
5921550.7%
 
5824230.8%
 
5722520.7%
 
5622890.8%
 
5576482.5%
 

Interactions

2021-02-05T14:53:52.795572image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:52.951872image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:53.109517image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:53.255167image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:53.402699image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:53.552218image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:53.707685image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:53.853428image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:54.006426image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:54.170700image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:54.344862image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:54.493448image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:54.643244image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:54.801079image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:54.968772image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:55.129139image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:55.300801image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:55.489805image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:55.651875image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:55.808433image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:55.961178image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:56.114876image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:56.267746image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:56.410222image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:56.558576image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:56.713061image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:56.869725image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:57.074599image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:57.221163image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:57.366883image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:57.530570image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:58.081083image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:58.232777image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:58.392783image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:58.548900image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:58.695038image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:58.892696image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:59.131341image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:59.291125image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:59.448258image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:59.605482image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:59.760570image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:53:59.922605image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:00.068796image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:00.215681image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:00.363294image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:00.530351image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:00.685702image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:00.845738image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:01.005240image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:01.160244image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:01.317654image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:01.456170image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:01.620922image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:01.831048image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:01.973734image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:02.128610image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:02.281919image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:02.478982image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:02.634749image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:02.793441image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:02.942643image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:03.094924image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:03.239308image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Correlations

2021-02-05T14:54:14.130364image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2021-02-05T14:54:14.359838image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2021-02-05T14:54:14.584032image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2021-02-05T14:54:14.811430image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.
2021-02-05T14:54:15.119139image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

2021-02-05T14:54:03.854399image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:05.139679image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:06.456392image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2021-02-05T14:54:06.877839image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Sample

First rows

Unnamed: 0TypePart of a policing operationLatitudeLongitudeGenderAge rangeOfficer-defined ethnicityLegislationObject of searchOutcome linked to object of searchRemoval of more than just outer clothingstationtargetOutcome_trueethnicityyearmonthdayday_of_weekhourminute
00person searchTrueNaNNaNmale18-24asianmisuse of drugs act 1971 (section 23)controlled drugsFalseFalsedevon-and-cornwallFalseFalseasian2019121600
11person searchTrueNaNNaNmale18-24whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsedevon-and-cornwallFalseFalsemissing2019121609
22person searchTrueNaNNaNfemale18-24whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsedevon-and-cornwallFalseFalsewhite20191216010
33person searchFalseNaNNaNmale18-24asianmisuse of drugs act 1971 (section 23)controlled drugsFalseFalsedevon-and-cornwallFalseFalsemissing20191216010
44person searchTrue50.368247-4.126646male18-24whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsedevon-and-cornwallFalseFalsemissing20191216012
55person searchTrueNaNNaNmale18-24whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsedevon-and-cornwallFalseFalsewhite20191216013
66person searchTrueNaNNaNmale25-34whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsedevon-and-cornwallFalseFalsewhite20191216016
77person searchTrueNaNNaNmale18-24whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsedevon-and-cornwallFalseFalsewhite20191216025
88person searchTrueNaNNaNmale18-24blackmisuse of drugs act 1971 (section 23)controlled drugsFalseFalsedevon-and-cornwallFalseFalseblack20191216025
99person searchTrueNaNNaNmale25-34blackmisuse of drugs act 1971 (section 23)controlled drugsFalseFalsedevon-and-cornwallFalseFalseblack20191216035

Last rows

Unnamed: 0TypePart of a policing operationLatitudeLongitudeGenderAge rangeOfficer-defined ethnicityLegislationObject of searchOutcome linked to object of searchRemoval of more than just outer clothingstationtargetOutcome_trueethnicityyearmonthdayday_of_weekhourminute
301656660601person and vehicle searchFalse51.156811-1.859133female25-34whitemisuse of drugs act 1971 (section 23)controlled drugsTrueFalsewiltshireTrueTruewhite20188255198
301657660602person searchFalse51.446286-2.013650male18-24whitepolice and criminal evidence act 1984 (section 1)offensive weaponsFalseFalsewiltshireFalseFalsewhite20188266100
301658660603person searchFalse51.720270-1.953499female10-17whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsewiltshireFalseFalsemissing201882662230
301659660604person searchmissingNaNNaNmale25-34whitepolice and criminal evidence act 1984 (section 1)article for use in theftFalseFalsewiltshireFalseFalsewhite201882701950
301660660605person searchmissingNaNNaNfemale25-34whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsewiltshireFalseTruewhite201882812225
301661660606person searchmissingNaNNaNmale18-24whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsewiltshireFalseFalsewhite20188292245
301662660607person and vehicle searchFalse51.540219-1.764708male18-24whitemisuse of drugs act 1971 (section 23)controlled drugsTrueFalsewiltshireTrueTruewhite20188292210
301663660608person searchFalse51.540219-1.764708male18-24whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsewiltshireFalseFalsewhite201882922110
301664660609person searchFalse51.540219-1.764708male18-24whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsewiltshireFalseFalsewhite201882922115
301665660610person searchmissingNaNNaNfemaleover 34whitemisuse of drugs act 1971 (section 23)controlled drugsFalseFalsewiltshireFalseTruewhite201883031315